Search CORE

eScholarship - University of California

Efficient Sparse Coding in Early Sensory Processing: Lessons from Signal Recovery

Author: A Lörincz
A Lörincz
A Lörincz
AJ Bell
András Lörincz
B Liu
B Natarajan
B Szatmáry
B Widrow
BA Olshausen
BA Olshausen
BA Olshausen
BT Vincent
C Cadieu
C Chennubhotla
D Cai
D Needell
D Needell
DCV Essen
DJ Graham
DL Donoho
DL Ringach
DW Dong
E Doi
E Doi
EJ Candès
EJ Candès
EJ Candès
EP Simoncelli
GC DeAngelis
GH Golub
Gábor Szirtes
H A
H Muehlenbein
HB Barlow
I Szita
IT Jolliffe
J Lücke
JA Cardin
JA Tropp
JJ Atick
JP Jones
Lyle J. Graham
M Rehn
M Riesenhuber
P Berkes
P Földiák
P Lennie
PT de Boer
RW Rodieck
RY Rubinstein
S Mallat
SB Laughlin
W Dai
YC Pati
Z Zhou
Zsolt Palotai
Publication venue: Public Library of Science
Publication date: 01/01/2011
Field of study

Sensory representations are not only sparse, but often overcomplete: coding units significantly outnumber the input units. For models of neural coding this overcompleteness poses a computational challenge for shaping the signal processing channels as well as for using the large and sparse representations in an efficient way. We argue that higher level overcompleteness becomes computationally tractable by imposing sparsity on synaptic activity and we also show that such structural sparsity can be facilitated by statistics based decomposition of the stimuli into typical and atypical parts prior to sparse coding. Typical parts represent large-scale correlations, thus they can be significantly compressed. Atypical parts, on the other hand, represent local features and are the subjects of actual sparse coding. When applied on natural images, our decomposition based sparse coding model can efficiently form overcomplete codes and both center-surround and oriented filters are obtained similar to those observed in the retina and the primary visual cortex, respectively. Therefore we hypothesize that the proposed computational architecture can be seen as a coherent functional model of the first stages of sensory coding in early vision

CiteSeerX

Publikationsserver der Universität Tübingen

ELTE Digital Institutional Repository (EDIT)

Catalyzing next-generation Artificial Intelligence through NeuroAI

Author: Bengio Y
Boahen K
Botvinick M
Chklovskii D
Churchland A
Clopath C
DiCarlo J
Escola S
Ganguli S
Hawkins J
Koulakov A
Körding K
LeCun Y
Lillicrap T
Marblestone A
Olshausen B
Pouget A
Richards B
Savin C
Sejnowski T
Simoncelli E
Solla S
Sussillo D
Tolias AS
Tsao D
Zador A
Ölveczky B
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 03/03/2023
Field of study

Neuroscience has long been an essential driver of progress in artificial intelligence (AI). We propose that to accelerate progress in AI, we must invest in fundamental research in NeuroAI. A core component of this is the embodied Turing test, which challenges AI animal models to interact with the sensorimotor world at skill levels akin to their living counterparts. The embodied Turing test shifts the focus from those capabilities like game playing and language that are especially well-developed or uniquely human to those capabilities - inherited from over 500 million years of evolution - that are shared with all animals. Building models that can pass the embodied Turing test will provide a roadmap for the next generation of AI

Spiral - Imperial College Digital Repository

Consequences of converting graded to action potentials upon neural information coding and energy efficiency

Author: A Borst
A Destexhe
A Hasenstaub
A Manwani
A Manwani
A Treves
AA Lazar
AA Lazar
AA Lazar
AL Hodgkin
AS French
B Aguera y Arcas
B Sengupta
B Sengupta
B Sengupta
B Sengupta
B Sengupta
BA Olshausen
BC Carter
Biswa Sengupta
C Koch
C Koch
C Shannon
CC Chow
D Attwell
D Desmaisons
D Lee
DM MacKay
DT Gillespie
E Marder
E Salinas
E Schneidman
E Skaugen
EM Izhikevich
F Theunissen
FA Dodge Jr
G Laurent
G Marsaglia
GG de Polavieja
H Alle
H Alle
J Haag
JA White
JA White
JC Rekling
JC Skou
JD Victor
JD Victor
JE Niven
JE Niven
JE Niven
JE Niven
JE Niven
Jeremy Edward Niven
K Koch
M Juusola
M Matsumoto
M Pinsker
M Stemmler
MB Kennel
ME Larkum
MH Kole
MN Shadlen
MS Grubb
MV Srinivasan
NJ Lenn
O Bernander
Olaf Sporns
PG Lillywhite
PN Steinmetz
R Guttman
R Sarpeshkar
RA DiCaprio
RRdR van Steveninck
S Curti
S Laughlin
SB Laughlin
Simon Barry Laughlin
SP Strong
SR Williams
TJ Gawne
V Prelov
W Singer
Y Shu
ZF Mainen
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2014
Field of study

Information is encoded in neural circuits using both graded and action potentials, converting between them within single neurons and successive processing layers. This conversion is accompanied by information loss and a drop in energy efficiency. We investigate the biophysical causes of this loss of information and efficiency by comparing spiking neuron models, containing stochastic voltage-gated Na+ and K+ channels, with generator potential and graded potential models lacking voltage-gated Na+ channels. We identify three causes of information loss in the generator potential that are the by-product of action potential generation: (1) the voltage-gated Na+ channels necessary for action potential generation increase intrinsic noise and (2) introduce non-linearities, and (3) the finite duration of the action potential creates a ‘footprint’ in the generator potential that obscures incoming signals. These three processes reduce information rates by ~50% in generator potentials, to ~3 times that of spike trains. Both generator potentials and graded potentials consume almost an order of magnitude less energy per second than spike trains. Because of the lower information rates of generator potentials they are substantially less energy efficient than graded potentials. However, both are an order of magnitude more efficient than spike trains due to the higher energy costs and low information content of spikes, emphasizing that there is a two-fold cost of converting analogue to digital; information loss and cost inflation

Public Library of Science (PLOS)

Open Access Repository of IISc Research Publications

Sussex Research Online

FigShare

Parametric study of EEG sensitivity to phase noise during face processing

Author: A Delorme
A Delorme
AB Sekuler
AB Sekuler
Allison B Sekuler
AV Oppenheim
B Jemel
B Rossion
BA Olshausen
BS Tjan
C Jacques
C Jacques
C Joyce
C Pernet
CA Olman
Cyril R Pernet
DA Jeffreys
DA Jeffreys
DM Tucker
EP Simoncelli
FA Kingdom
FA Wichmann
G Felsen
G Rainer
G Rainer
GA Rousselet
GA Rousselet
GA Rousselet
GA Rousselet
GA Rousselet
GA Rousselet
Guillaume A Rousselet
J Bullier
J Bullier
J Drewes
J Gold
J Portilla
JJ DiCarlo
JS Husk
K Bötzel
K Grill-Spector
K Tanaka
KL Hoffman
LC Loschky
LT DeCarlo
MC Morrone
MG Philiastides
MG Philiastides
MG Thomson
MG Thomson
ML Smith
MM Murray
NC Rust
NK Logothetis
O Hauk
P Sehatpour
Patrick J Bennett
PG Schyns
PG Schyns
R VanRullen
RJ Itier
RJ Itier
RJ Itier
RJ Itier
RR Wilcox
S Bentin
S Bentin
SA Hillyard
SC Dakin
SJ Thorpe
T Tanskanen
T Tanskanen
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2008
Field of study

Background: The present paper examines the visual processing speed of complex objects, here faces, by mapping the relationship between object physical properties and single-trial brain responses. Measuring visual processing speed is challenging because uncontrolled physical differences that co-vary with object categories might affect brain measurements, thus biasing our speed estimates. Recently, we demonstrated that early event-related potential (ERP) differences between faces and objects are preserved even when images differ only in phase information, and amplitude spectra are equated across image categories. Here, we use a parametric design to study how early ERP to faces are shaped by phase information. Subjects performed a two-alternative force choice discrimination between two faces (Experiment 1) or textures (two control experiments). All stimuli had the same amplitude spectrum and were presented at 11 phase noise levels, varying from 0% to 100% in 10% increments, using a linear phase interpolation technique. Single-trial ERP data from each subject were analysed using a multiple linear regression model. Results: Our results show that sensitivity to phase noise in faces emerges progressively in a short time window between the P1 and the N170 ERP visual components. The sensitivity to phase noise starts at about 120–130 ms after stimulus onset and continues for another 25–40 ms. This result was robust both within and across subjects. A control experiment using pink noise textures, which had the same second-order statistics as the faces used in Experiment 1, demonstrated that the sensitivity to phase noise observed for faces cannot be explained by the presence of global image structure alone. A second control experiment used wavelet textures that were matched to the face stimuli in terms of second- and higher-order image statistics. Results from this experiment suggest that higher-order statistics of faces are necessary but not sufficient to obtain the sensitivity to phase noise function observed in response to faces. Conclusion: Our results constitute the first quantitative assessment of the time course of phase information processing by the human visual brain. We interpret our results in a framework that focuses on image statistics and single-trial analyses

Springer - Publisher Connector

Edinburgh Research Explorer

Enlighten

Age-related delay in information accrual for faces: Evidence from a parametric, single-trial EEG approach

Author: A Delorme
A Delorme
A Gazzaley
A Gazzaley
A Nakamura
A Peters
A Peters
A Peters
A Peters
A Peters
AB Sekuler
AB Sekuler
AG Leventhal
Allison B Sekuler
AV Oppenheim
B Rossion
B Rossion
BA Olshausen
C Habak
C Thomas
Carl M Gaspar
CE Schroeder
CL Grady
CL Grady
CL Grady
Cyril R Pernet
D Payer
DC Park
DH Mathalon
DI Perrett
DM Tucker
DP Hanes
E Rodriguez
E Roudaia
EM Pfutze
ER Sowell
F Barcelo
F Di Russo
F Schieber
GA Rousselet
GA Rousselet
GA Rousselet
GA Rousselet
GA Rousselet
Guillaume A Rousselet
H Duan
H Wang
H Wiese
I Boutet
IJ Deary
J Billino
J Bullier
J Gold
J Klopp
J Yordanova
JB Rowe
JD Hinman
Jesse S Husk
JJ DiCarlo
JJ Foxe
JS Husk
JW Page
K Bötzel
L Chaby
L Chaby
L Marner
LA Lott
LR Betts
LR Betts
M Kutas
MG Philiastides
MG Thomson
ML Smith
MT Schmolesky
N Raz
N Wild-Wall
O Hauk
P Kovesi
P Kovesi
Patrick J Bennett
PD Spear
PG Schyns
PG Schyns
PJ Bennett
PJ Bennett
R Ceponiene
R Sekuler
R Sekuler
RE Dustman
RJ Itier
RJ Itier
RJ Itier
RJ Itier
RM Crum
RR Wilcox
S Makeig
S Tobimatsu
S Watanabe
S Yu
SC Dakin
SW Davis
SW Govenlock
T Allison
T Curran
T Hua
TA Salthouse
V Kolev
WJ Gehring
Y Wang
Y Wang
Y Wang
Publication venue: 'Association for Research in Vision and Ophthalmology (ARVO)'
Publication date: 01/01/2009
Field of study

Background: In this study, we quantified age-related changes in the time-course of face processing by means of an innovative single-trial ERP approach. Unlike analyses used in previous studies, our approach does not rely on peak measurements and can provide a more sensitive measure of processing delays. Young and old adults (mean ages 22 and 70 years) performed a non-speeded discrimination task between two faces. The phase spectrum of these faces was manipulated parametrically to create pictures that ranged between pure noise (0% phase information) and the undistorted signal (100% phase information), with five intermediate steps. Results: Behavioural 75% correct thresholds were on average lower, and maximum accuracy was higher, in younger than older observers. ERPs from each subject were entered into a single-trial general linear regression model to identify variations in neural activity statistically associated with changes in image structure. The earliest age-related ERP differences occurred in the time window of the N170. Older observers had a significantly stronger N170 in response to noise, but this age difference decreased with increasing phase information. Overall, manipulating image phase information had a greater effect on ERPs from younger observers, which was quantified using a hierarchical modelling approach. Importantly, visual activity was modulated by the same stimulus parameters in younger and older subjects. The fit of the model, indexed by R2, was computed at multiple post-stimulus time points. The time-course of the R2 function showed a significantly slower processing in older observers starting around 120 ms after stimulus onset. This age-related delay increased over time to reach a maximum around 190 ms, at which latency younger observers had around 50 ms time lead over older observers. Conclusion: Using a component-free ERP analysis that provides a precise timing of the visual system sensitivity to image structure, the current study demonstrates that older observers accumulate face information more slowly than younger subjects. Additionally, the N170 appears to be less face-sensitive in older observers

Springer - Publisher Connector

Edinburgh Research Explorer

Enlighten

Fully Trainable and Interpretable Non-Local Sparse Models for Image Restoration

Author: A Beck
A Foi
B Gunturk
BA Olshausen
C Bertocchi
C Dong
F Kokkinos
J Mairal
J Mairal
J Mairal
J Portilla
K Dabov
K Ma
K Zhang
K Zhang
LI Rudin
M Aharon
M Elad
M Gharbi
MAT Figueiredo
P Perona
R Jenatton
S Mallat
W Dong
X Liu
Y Chen
Y Romano
Publication venue
Publication date: 20/08/2020
Field of study

Non-local self-similarity and sparsity principles have proven to be powerful priors for natural image modeling. We propose a novel differentiable relaxation of joint sparsity that exploits both principles and leads to a general framework for image restoration which is (1) trainable end to end, (2) fully interpretable, and (3) much more compact than competing deep learning architectures. We apply this approach to denoising, jpeg deblocking, and demosaicking, and show that, with as few as 100K parameters, its performance on several standard benchmarks is on par or better than state-of-the-art methods that may have an order of magnitude or more parameters.Comment: ECCV 202

arXiv.org e-Print Archive

Hal - Université Grenoble Alpes

INRIA a CCSD electronic archive server

1/f2 Characteristics and Isotropy in the Fourier Power Spectra of Visual Art, Cartoons, Comics, Mangas, and Different Categories of Photographs

Author: A Hyvärinen
A Torralba
A van der Schaaf
AB Lee
AC Danto
AM Martinez
AS Georghiades
B Spehar
BA Olshausen
BA Olshausen
BC Hansen
C Redies
C Redies
C Redies
Christoph Redies
D Fernandez
DJ Field
DJ Graham
DJ Graham
DJ Graham
DJ Tolhurst
DL Ruderman
DL Ruderman
E Burke
G Paul
GJ Burton
GT Fechner
I Kant
J Alvarez-Ramirez
JH van Hateren
Joachim Denzler
JR Mureika
K Pearson
M Turk
Mark W. Greenlee
Michael Koch
MW Beauvois
N Goodman
PC Mahalanobis
PO Hoyer
RF Voss
RG Bosworth
RP Taylor
S Zeki
W Kandinsky
WE Vinje
WS Geisler
Y Joye
Y Yu
Publication venue: Public Library of Science
Publication date: 01/08/2010
Field of study

Art images and natural scenes have in common that their radially averaged (1D) Fourier spectral power falls according to a power-law with increasing spatial frequency (1/f2 characteristics), which implies that the power spectra have scale-invariant properties. In the present study, we show that other categories of man-made images, cartoons and graphic novels (comics and mangas), have similar properties. Further on, we extend our investigations to 2D power spectra. In order to determine whether the Fourier power spectra of man-made images differed from those of other categories of images (photographs of natural scenes, objects, faces and plants and scientific illustrations), we analyzed their 2D power spectra by principal component analysis. Results indicated that the first fifteen principal components allowed a partial separation of the different image categories. The differences between the image categories were studied in more detail by analyzing whether the mean power and the slope of the power gradients from low to high spatial frequencies varied across orientations in the power spectra. Mean power was generally higher in cardinal orientations both in real-world photographs and artworks, with no systematic difference between the two types of images. However, the slope of the power gradients showed a lower degree of mean variability across spectral orientations (i.e., more isotropy) in art images, cartoons and graphic novels than in photographs of comparable subject matters. Taken together, these results indicate that art images, cartoons and graphic novels possess relatively uniform 1/f2 characteristics across all orientations. In conclusion, the man-made stimuli studied, which were presumably produced to evoke pleasant and/or enjoyable visual perception in human observers, form a subset of all images and share statistical properties in their Fourier power spectra. Whether these properties are necessary or sufficient to induce aesthetic perception remains to be investigated

Public Library of Science (PLOS)

Public Library of Science (PLOS)

Emergence of Visual Saliency from Natural Scenes via Context-Mediated Probability Distributions Coding

Author: A Hyvarinen
A Hyvarinen
A Olmos
A Torralba
AJ Bell
AM Treisman
B Julesz
BA Olshausen
BA Olshausen
BW Tatler
C Kayser
C Koch
D Field
D Gao
D Gao
D Gao
EP Simoncelli
EP Simoncelli
F Attneave
G Felsen
GC DeAngelis
H Barlow
HJ Seo
JH van Hateren
JH van Hateren
Jinhua Xu
JJ Atick
JM Wolf
Joe Z. Tsien
L Itti
L Itti
L Itti
L Itti
L Zhang
L Zhang
L Zhaoping
M Carandini
Matjaz Perc
MS Caywood
NC Rust
ND Bruce
NDB Bruce
O Le Meur
PO Hoyer
RP Rao
RPN Rao
T Wachtler
TD Albright
WE Vinje
WJ Ma
WS Geisler
X Chen
Y Karklin
Z Li
Zhiyong Yang
Publication venue: Public Library of Science
Publication date: 29/12/2010
Field of study

Visual saliency is the perceptual quality that makes some items in visual scenes stand out from their immediate contexts. Visual saliency plays important roles in natural vision in that saliency can direct eye movements, deploy attention, and facilitate tasks like object detection and scene understanding. A central unsolved issue is: What features should be encoded in the early visual cortex for detecting salient features in natural scenes? To explore this important issue, we propose a hypothesis that visual saliency is based on efficient encoding of the probability distributions (PDs) of visual variables in specific contexts in natural scenes, referred to as context-mediated PDs in natural scenes. In this concept, computational units in the model of the early visual system do not act as feature detectors but rather as estimators of the context-mediated PDs of a full range of visual variables in natural scenes, which directly give rise to a measure of visual saliency of any input stimulus. To test this hypothesis, we developed a model of the context-mediated PDs in natural scenes using a modified algorithm for independent component analysis (ICA) and derived a measure of visual saliency based on these PDs estimated from a set of natural scenes. We demonstrated that visual saliency based on the context-mediated PDs in natural scenes effectively predicts human gaze in free-viewing of both static and dynamic natural scenes. This study suggests that the computation based on the context-mediated PDs of visual variables in natural scenes may underlie the neural mechanism in the early visual cortex for detecting salient features in natural scenes